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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Fred Hutchinson Cancer Research Center, Inc. 

(B) STREET: 1100 Fairview Avenue North, Mailstop C2M-027 

(C) CITY: Seattle 

(D) STATE : Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98109 

(A) NAME: Thomas Spies 

(B) STREET: 2429 E. Aloha 

(C) CITY: Seattle ' 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 98112 

(A) NAME: Veronika Spies 

(B) STREET: 2429 E. Aloha 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 98112 

(ii) TITLE OF INVENTION: CELL STRESS REGULATED HUMAN MHC CLASS 
(iii) NUMBER OF SEQUENCES: 16 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/029,044 

(B) FILING DATE: 29-OCT-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
CACTGCTTGA GCCGCTGAGA GGGTGGCGAC GTCGGGGCCA TGGGGCTGGG CCCGGTCTTC 
CTGCTTCTGG CTGGCATCTT CCCTTTTGCA CCTCCGGGAG CTGCTGCTGG TGAGTGGCGT 
TCCTGGCGGT CCTCGGCGGA GCGGGAGCAG TGGGACGTTT CCGGGGGTCG GGTGGGTAGC 



GGCGAGCGCT GTGCGGTCAG GGCGGGGCTC CTGTGCCCTG TCGGTGGCGC AGGGAGCTGG 
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ACGCGGCCCG TTACCGCCAC ACTTCAGCCC 
GGATCGCGCA TCACCTGCAC TTTCTGGTCT 
CCGCTTCCTC TCACTTTTCG GACAAACCAG 
CCTCCGGGGC TGCTCCTGTG AATGGCATTC 
GCCACAACCC CCGGTGCTCX3 GGGCGGCTCT 
GCCGACGTGT TCTCAGCTCC TGGGCCGCAG 
ACCCGGAGCT GGTGCTTCCT GCTGCTGTGG 
AGGGGTTGGA GTCTGCAGGC TCCGGGCAGA 
GCTCATTCTC CCCTC6CCTC TGGCTCCGAG 
CTAATGACCA GTGATCTAAG GACACCAGAT 
GGCCCGCTGA GTCCTTTTGC CCTCCCA6CT 
TGATCCATTT CTAGGGTGTC CTCTGCCCTC 
TGCACCCCTT ATGGGCCTTT CCTACAAGCA 
CCGTTCCCAA AT6TCCCTGA CTCTAACTTT 
CCCTCCATCC CACTCCCCTC CAGACCCCCA 
CCTTAGGCAG GGCGCAGGGC AGCGCAGATG 
TGGACCCTGC CTCATTGTGG CCCCTTCCCC 
GGACCCCACC CTCCAGCCCA CAGGTGCTGG 
CCACCTTGGG ACCTTGTGCT GCTCCTATCT 
TTCTCATCTT TCAGTGGGAA AGTGGGAGTG 
CCCAAGCCCC ACCCTCCCCC AGAGCACCCT 
TCCCACAGTC ACTCCTGCCC CATGCTCATG 
CCCTCCCCAA CCCAGACCTA AAACAGGCTG 
TTTCTTTTGG TTCCTTGACC CCAGTGGGCT 
GTTTTGCCTG CTCTTGGGGT GCCACTGCTC 
CTTCCTCAGG CTGAGAATCT CCCCCTCTAC 
ACTCCTTGCT TTGTTTTTAA TTTTTAACTT 
TATATTTATG GGGTACATGG GATATTTTGA 
AGGGTAAATG GGTTATATCA CAACAAGCAT 



TGCTTCCCCG TCACTTTTCA GTCCTCCTCG 
CCTCCTGCTC TTTCTCTCCT CGCGTCTCCT 
TCCTTCTGAG GCCCATGGGT TCCCGGGCTG 
GAGTGCCCTT CCAGCX3CGGC CACTGAAGCA 
CAGGTCCCTG AAGTCCTGTC CTCTCCCGGA 
CTCCTGGAGT AGGGGCCCTC CTTTCTCGGG 
GGACT6TGGG. GGGTCCTGAC TCTCAAGCTGL 
GGATTCTTCC TGCGACTTCT CTCATCCCCA 
GGTCCTCTCC TCTCTCTCAT CCCACCCCTA 
TCCCTCTCAC CTCCTCCCTG CCCATCTCAG 
CCCTGCTACC CCTTCCTGTG TGCT6TTCTC 
ATCCCCTGTC CCCGCCACCG AAGTCCCTCC 
GCCTTCACCC AGTGCTGCCC CTATGCCTCC 
CTGGTGCTGC CTTTTATCCG GGGGGGTCTT 
AGGGGAACCC TGATGCTAAT GGCAGTTGGG 
CCCCCTCCCC TCCAGTGCAG ATGCCTGTTC 
ACTCCTTCAT CCTCAGCCTC ACCCTCTTGA 
ACCATCCCTC CCTGGTCCCT CCGCCCCTCT 
CTTGCCCAGC TGCCTTGGGC CCTCAGCACG 
CTGGAGCATA TGACAGTGCT GAGCATCTTT 
CCCCTCCTGT CCTCACCCTA CCCCAAGTTC 
CCGCCCTCCA GTTCTTGCTC TGCCCATCTC 
TTGGGCCAAC TGTTCCTTGA CCTTCCTTCT 
CTCACTCCCC ACACCGCATA TCTAAAATCT 
CCCCTCCAGC ATTACTCCTT TTGGCAGGTC 
CTTGGTTTTC TCTCTCTGGC CAGCACCCCC 
TTGTTTGGGT ACGTAGTAGA TATATATGTA 
CACAGGCCTA CAATATGTAA TAATCACATC 
TTATCCTTTC TTTGTGCTAC AAACAATCCC 
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ATTATGCTCT TTCAGTTATT TTTAAATGTA CAATAAATTA TTGTTGACTG TACTCACCCT 2040 

GCTGTGCTAT CTACTAGATC TTATTCATTC TAATTATATT TTTGTACCCA TTATTAACCA 2100 

TCCCTGCTCC CCCACTCCCC ACTACCCTTC TCAGCCTCTG GTAATCATCA TTCTATTGTC 2160 

TCTCCCCATG AGGTCCATTG TTTTAAATTT TGGCTGCCAC AAATAAGTGA GAACATGCAA 2220 

AGTTTGTCTG TCTGGGCCTG GGGCTTATTT CACTTCACAG GATGACCTCC AGTTCTTTGC 2280 

AAATGACACG ATGGCTGAAT AGTTCTCCAC ATACACATGT ACACCACATT TTCTTTATCC 2340 

ATGCGTCTGT TGATGGACAC TTAGATTGCT . TGCAGAXCTX .GGCTACTTTG..AATAGTGCTG 24D0 

CAATAAACAT GGAAAAGTAG ATAGCTCTTT AATATACCGA TTTCCTTTCT TTGGAGTATA 2460 

TGCCTAACAG TGGGAGTGCT GGAGCATATG ACAGCTCTAT TGTATTTTTA GTTTTTGGAA 2520 

GAACCTCCAC ATTGTTTCCC ATAGTGGTTG TACTAGTTTA CGTTCCCACC AACAGTGTAC 2580 

ATCCTCACCA GCATTCCTTA TTTCTACATC CTCGCCAGCA TTCCTTATTG CCTGTCTTCT 2640 

GGATAAAAGC CAGTTTATCT GGGGTGGGAT GTTATCTCX5T AGGAGTTTTG ATTTGCCTTC 2700 

ATCTGTTGAC GAATGATGTT GAGCACCTTT TCATATACCT GTTTGCCATT TATATGTCTT 2760 

CTTTTGAGAA ATGACTATTC AGATCTTTTC TCATTTTTAA ATTGGATTAT TATATTTTTT 2820 

TTCCTATAGT TGTTCGAGCT CCTTATATGT TTCAGTTACT GATCCTTTGT CAGATGAATA 2880 

GTTTGAAAAT ATTTTCTCCC ATTCTTGGAT GGTCTCTTCA TTTTGTTTAT TGTTTCCTTT 2940 

GCTGTGCAGA AGCCTTTTTA CTTGATATGA TCCCATTTAT GCAATTTTAC TTTGGTTACC 3000 

TGTGCTTGTG GGGTATTACT TTAAAAATCT TTGCCCAGTC CAATATCCTA GAGAGTTTCC 3060 

CCAATGTTTT CTTGTATAGT TTCATAGTTT GAGGTCATAG ATTTACATCT TTAATCCACT 3120 

TTGATTTGAT TTTTGTATAT GGTGAAAGAC AGGGTCTAGT TTCATTCTTC TGCATAAGGA 3180 

TATCTAGTTT CCCCAGCACC ATTTTTGAAG AGACTCTCCT TTGCCAATGT GTGTTCTTGG 3240 

TACCTTTGTT GGAAATGAGT TTACTGTAGA TGTATGGAAT TGTTTCTGGG TTCTCTATTC 3300 

TGTTTCATTG GTCTGTGTGT CTGTTTTTAT GCCAGTATCA TGCTGTTTTG GTTACTGTAG 3360 

CTCTGTAGTA TAATTTGAAG TCAGATAATG TGATTCCTCT AGTTTTGTTC ATTTTGCTCA 3420 

GGATAGCTTT ATCTATTCTG GTTTTTTTGT GGTTCCATAT GCATTTTAGG ATTATTTTTA 3480 

TTATTTCTGT GAAGAATGTC ATTAGTGTTT TGATAGGGAT TGCATTGAAT CTGTAGATTA 3540 

CTTTGGGTAG TATGGATATT TCAACAAAAC TGATTCTTCC AATCCATGAA CGTGGACTAT 3600 

CTTTTCCATT TTTTGTGTCC TTCAATTTTT TGCATCAGTG TTTTTTGTTT TTGGTTTTTG 3660 

AGATGGAGTT TCACTCTTGT TGCCCAGGCT AGAATGCAAG GGTGTGATCT TGGCTCACCG 3720 
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CAACCTCCGC CTCCCAGGTT CAAGCTATTC TTCTGCCTCA GCCTCCCAAG TAGCTGGGAT 3780 

TACAGGCATG TGCCACTGTG CCTGGCTAAT TTTCTATTTT TATTAGAGAT GGGGTTTCTC 3840 

TATGTTGGCC AGGCTAGTCT TGAACTCCTG ACCTCAGGTG ATCCACCTGC CTCGGCCTCC 3900 

CAAAGTGCTG GGATTACAGG CATGAGCCAC CACGCCCAGC CACATCACTG TTTTATAGTT 3960 

TTTATTGGAG AGGTCTTTCA CTTCTTCAGT TAGGTTTATT CCTCAGTATT TTATTTTATT 4020 

TGTAGCTATT GTAAATGGGA TTCGTTTCTT GATTTCTTTT TCAGATTATT TGCTGTTAGC 4080 

ACTGATTTTT GCATGTTGAT- TTTGTATCCT- -GCAACTTTAC TGAATTTGTT CTTCAGTTCT 4140 

AATGGTTTTT TGGTGGAGTC TTTAGGTTTT TCCAAATATC AGACCACATG ATCTGCAAAC 4200 

AAGGATAATT TGACTTCTTC TTTTCCAGTT TTAATGCCCT TTCTTTCTTT CTCCTGTCTG 4260 

ATTGCTCTAG TTAGGATCTG CAGTACTGTG TTGCATAACT GTGGTAAAAT TAGTCATCCT 4320 

TGTCTTATTC CAGATCTTAG AGAAAAGGCT TTCAGTTTTC CCCCATTCAG TATGTTACTA 4380 

GCTGTGAGTT TGTCATATAT GGCTTTTATT ATATTGAGGT, CTGTTCCTTG TATACTTAGT 4440 

TTTTTGAGAG TTTTTATCAT GAAGGGATGT TGAATTTATC AAATGCTTTT TCAGTATCAA 4500 

TTGAATGATA CTGGCTTTTG TCCTTTATTC TGTTGATATG ACGTATTACA TTGATTGATT 4560 

TGTGTATGTT AAATCATCCT TGCATACCTG GAATACATTC CACTTGCTCA TAAAGAATGA 4620 

TCTTTTTTAA TGTATTGTTG AATGTGGTTT GCTAGTATTT CCTTGACGAT TTTTGCATCG 4680 

GTGTTCATCA GGGATATAGG CCTGTAGTTT TCTTTTTTAT GATGTGTCTT TGCCTGGTTT 4740 

TTGTATCAGG ATATTCCTGG CTTTGTAAAA TGAGTTTGGA AGTATTCCCT CCTCCTCTAT 4800 

TTTTCAGAAC AGTTTGAATA GGACTGACAT ATGTTGTTCT TTAAAAGTTT AATTGTGGTA 4860 

AATTATACAT TACATAAATT TTACTGTTTT AACCACTTTT AAGTGTATAC TCX3GTGGCAT 4920 

TAGATACATT CACATTTTTG TGCAACCCAA AACTCT6TGC CCATTAATCG GTAACTCCCC 4980 

ATTCCTCCCT ACCTCTGGCC CCTGGTAACC ACCATTCTAC TTTTTGTTTC TATGAATTTG 5040 

ACCACTCTAG GTACCTCATT TAAGCAGAAT CATGTAATGT TTGTCTTTTT GTTTCTGGCT 5100 

TATTTCACTT ATAATATTTT TGAGGTTCGG TGGGCACAGT GGCTCACGCC TGGATTTCCA 5160 

GCACTTTGGG AGGCTGAAGC AGGTGGATCA CCTGAGTTTC GGAGTTCGAA ACCAGCCTGG 5220 

CCAACATGGT GAAACCCCAT CTCTACTAAA AATAATAAAA GTTAGCCGGG CGTGATGGCG 5280 

GGTGCCTGTA ATCCCAACTA CTTGGGAGGC TGAGGCAGGA GAATCGCTTG AATCCGGGAA 5340 

GTGGAGGTTG CAGTGAGCTG AGATCAGGCC ACTGCACTCC AGCCTGGGCA ACAAGAGTGA 5400 

AATTCCATCT CCAAAAAAAA AAAATAAAAC AATAATAATA ATAATATTTT TGAGGTTCAT 5460 
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CCAAGTTGTA GTATGGGTCA GAATTTCATT CCTTTTJ\AGG ATGGATAATA CTCATTATAT 5520 

GTATGTACCA CATCTTGGTT ATCCATCCCT CAGACAATGG ACACTTGGGT TACTTCTACC 5580 

TTTTGGATAT TGGCAAATAT TTCATTTCCT TTGGGTATAT ATTTATTTCC TTTGGGTATT 5640 

TCTTTTGGGT ATATATCCAG AAATAGAAGC AGTACACAGG GGCTTCATTT TCTCTGTCTC 5700 

TTTGCCAACC TTGCTCTGTG TGTGTGTGTA TGTGTGTGTG TAGGTGTGTG ATAACAGCCA 5760 

TCCTGATTGG TTTCAGGTGG CATCTCATTG TGGTTTGGAT TTGCATTTTC CTAATGAGTG 5820 

CTGATATTGA GCATCTTTTC ATGTGTTTGT TGAXCATTTG_.TAATTCTOT; .TGAAGAAT^ 5.88.0. 

GCCATTTAAG TCTTTTGCCC ATTTTTTCCC CCACATAGCT TCTCTTATCA GATATATGAC 5940 

TTGCAATATT TATTTCATTT CX3GGGTTGAT TGCTTTTTCA CTCTGATTGT GCCCTTTGAT 6000 

GCATAGATGT TTTGAATTTT CATCAGTCTA CTTTGTCAGT TCTTTCTATT CTATCTGTGC 6060 

TTTGGTGTCA TATCCATGAA AGCACTGTCA AATCCTATGT CT^TGAACATT ATCCCCAATG 6120 

TTTGCTTCTA AGAAATTTTT AGGTTTTAGT TCTTGAGTGT AGAGTTTAGG TCTTTGATTC 6180 

ATTTTGAGTT AATTTTTGTA TATAGTGCAA ATTAAGGGTC CAATTTTATT TTAACACCCC 6240 

CTGCCCCCAG AACTATTTGC TGAAAAGATC AACTGACTCT TTGTCACCTG CTCACCCCAG 6300 

TGGACACTAG CTGTTCCATC CAATTGCTGT CCTGGGGCCT TGTCATGCTA CTCTTCCACT 6360 

TTGAACCCAA GCCCACACCG TTCGTTGCTC CCCTCTGGGA TACTGACCCC ACTATAAACT 6420 

TCTCTGGGGC TACAACCTTC CTACCCTTTG TGCCTCATGA CCACCCCCTC CCTTGTCCCC 6480 

GCCATGCCCA TGATGAGTCT CTTCTCGAGG CAGCTCCCCT TGCCTCCATC TCACCCTCAG 6540 

CCTATGCACC ACAGCCACAC TGGACATGGG TCCCTCTGAG CCTGAGTCCC TTCCCATTCC 6600 

CACCATCTCC TCTGGCAAGA CCTTCCTTCC ACCACCTTCA TGCTCCTCCC TTGCCCCTGC 6660 

AGGGCAGCCT CTCCCCTTGG CCCCTATTCC CTTAGGGGGC TTGTGOCCAC CCAGTCCTTG 6720 

CACCTGGCCT ACAAGTTTGC CATCTTCATT CCCCCTTCTT CTGTTCATCA GCCCCCTCCT 6780 

CTATCCTCCC ACCCTCACAG TTTTCTTTGT ATATGAAATC CTCGTTCTTG TCCCTTTGCC 6840 

CGTGTGCATT TCCTGCCCCA GGAAGGTTGG GACAGCAGAC CTGTGTGTTA AACATCAATG 6900 

TGAAGTTACT TCCAGGAAGA AGTTTCACCT GTGATTTCCT CTTCCXrCAGA GCCCCACAGT 6960 

CTTCGTTATA ACCTCACX3GT GCTGTCCTGG GATGGATCTG TGCAGTCAGG GTTTCTTGCT 7020 

GAGGTACATC TGGATGGTCA GCCCTTCCTG CGCTATGACA GGCAGAAATG CAGCGCAAAG 7080 

CCCCAGGGAC AGTGGGCAGA AGATGTCCTG GGAAATAAGA CATGGGACAG AGAGACCAGG 7140 

GACTTGACAG GGAACGGAAA GGACCTCAGG ATGACCCTGG CTCATATCAA GGACCAGAAA 7200 
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GAAGGTGAGA GTCGGCAGGG GCAAGAGTGA CTGGAGAGGC CTTTTCCAGA AAAGTTAGGG 7260 

GCAGAGAGCA GGGACCTGTC TCTTCCCACT GGATCTGGCT CAGGCTGGGG GTGAGGAATG 7320 

GGGGTCAGTG GAACTCAGCA GGGAGGTGAG CCGGCACTCA GCCCACACAG GGAGGCATGG 7380 

GGGAGGGCCA GGGAGGCGTA CCCCCTGG6C TGAGTTCCTC ACTTGGGTGG AAAGGTGATG 7440 

GGTTCGGGAA TGGAGAAGTC ACTGCTGGGT GGGGGCAGGC TTGCATTCCC TCCAGGAGAT 7500 

TAGGGTCTGT GAGATCCATG AAGACAACAG CACCAGGAGC TCCCAGCATT TCTACTACGA 7560 

TGGGGAGCTC TTCCTCTCCC AAAACGTGGA 6ACTGAGGAA, TGGACAGTGC .CCCAGTCCTC 7620 _ 

CAGAGCTCAG ACCTTGGCCA TGAACGTCAG GAATTTCTTG AA6GAAGATG CCAT6AAGAC 7680 

CAAGACACAC TATCACGCTA TGCATGCAGA CTGCCTGCAG GAACTACGGC GATATCTAGA 7740 

ATCCAGCGTA GTCCTGAGGA GAAGAGGTAC GGACGCTGGC CAGGGGCTCT CCTCTCCCTC 7800 

CAATTCTGCT AGAGTTGCCT CACCTCCAAG ATGTGTCCAG GGAAACCCTC CCTGTGCTAT 7860 

GGATGAAGGC ATTTCCTGTT GGCACATCGT GTCCTGATTT TCCTCTATTG TTAGAGCCAC 7920 

TGGATAAAGA CAGTGGGTCA GGGACTGGAC CATCCAGTGT TGTAATCAGG GCAAGTAGAG 7980 

GACCCTCCGA CAGAATCCTG AGCCTGTGGT GGGTGTCAGG CAGGAGAGGA AGCCTTCAGG 8040 

GCCTIGGGCTG CCCCCTCTGC CTCCCAGCCT GCCCATCCTG GAGAGTTCCC TCCTGGCCCC 8100 

ACAACCCAGG AGTCCACCCC TGACATCCCC CTCCTCA6CA TCAATGTGGG GATCCCAGAG 8160 

CCTGAGGCCA CAGTCCCAAG GCCCATCCTG CTGCCAGCCT GGAAGAACTG GGCCCCAGAG 8220 

TGAGGACAGA CTTGCAGGTC AGGGGTCCCG GAGGGCTTCA GCCAGAGTGA GAACAGTGAA 8280 

GAGAAACAGC CCTGTTCCTC TCCCCTCCTT AGAGGGGAGC AGGGCTTCAC TGGCTCTGCC 8340 

CTTTCTTCTC CAGTGCCCCC CATGGTGAAT GTCACCCGCA 6CGAGGCCTC AGAGGGCAAC 8400 

ATCACCGTGA CATGCAGGGC TTCCAGCTTC TATCCCCGGA ATATCACACT GACCTGGCGT 8460 

CAGGATGGGG TATCTTTGAG CCACGACACC CAGCAGTGGG GGGATGTCCT GCCTGATGGG 8520 

AATGGAACCT ACCAGACCTG GGTGGCCACC AGGATTTGCC AAGGAGAGGA GCAGAGGTTC 8580 

ACCTGCTACA TGGAACACAG CGGGAATCAC AGCACTCACC CTGTGCCCTC TGGTGAGCCT 8640 

AGGGTGACCC TGGAGAGGGT CAGGCCAGGG TAGGGACAGC AGGGATGGCT GTGGCTCTCT 8700 

GCCCAGTGTA TAACAAGTCC CTTTTTTTCA GGGAAAGTGC TGGTGCTTCA GAGTCATTGG 8760 

CAGACATTCC ATGTTTCTGC TGTTGCTGCT GCTGCTGCTG CTATTTTTGT TATTATTATT 8820 

TTCTATGTCC GTTGTTGTAA GAAGAAAACA TCAGCTGCAG AGGGTCCAGG TGAGAAAAGC 8880 

GGGCAGTTTC TGGAGATGGT AAGGCCCCTG TCTGGGCAGT AGGGTCCCCT CATTGCTCCT 8940 
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GCAAAGATAG GCATGTTGGT GACAAGGCTT CTGTAACAGG GGATGAAAGT TGGGGAATTT 9000 
GGGAAGGGAA TGGGGGCAGC ATCTCCATCT ACACCCATAA GTGCTGCCCA AGCGAGGGTC 9060 
AAACGCCCAG CTGTGGCATC TTCCTGCTGC AGGTGAGGAG TGGGCAGCAG GGAGGGCTGC 9120 
GGCGCCTGCT CTGTCCCCAT CCCGGTCTCT GTGTCTCTTG GACTCACTAG GGCGCATCCA 9180 
GGTGGGGTGA GCTGGGAATC ACGTGCTGAA TGCTGAGGGC CTGGATGATC ACGGCCTCAG 9240 
AGGGAGCAAA TAGTAAAGGC AGCTGTGATC TGGGGAGGGC CAGAAACTGG AGAGGAATCT 9300 

GAGGAGAGGC GGTGCCCCTA. .TTCCCTTCCT CTCTGCATCC CCCTCCCCTG_TTTCTCCAGC 93 6 0 

CATCGGGGCG GACACCGAGA AAAAGACCTA TGAGGCCCAG CCTGGGGGCC CTGCCTGTGT 9420 
AGCCCTTTGG AGACCCCTAG TAACAGGGAG GGTCCTGAGC ACACATGGCC ATCTCTGTCC 94 80 
ACTGTGCAGC TCCCCATGCA CCTCCTCCAG GAGCTTTCTT GGGGTTGTCG TGTCCTCTGC 9540 
ACCATTCGAG GCCCTACTCT TTCCAGGTTC CCACGGCCTG GCCTCCCTGA GTTTCTTGCA 9600 
GATGACATGG ATGAGTAGAT AAGCAGATGT CCCTGGGCCA TTTGAGGAGT GGGGCCCAGC 9660 
CCCTCATCAG GGCAGCTGTG GTCCCTGTTT TCATCCTACC TCCGAGTGTT TTCTTCTCCA 9720 
GTCCCTGAGG 6ACACAGTCC TCAGGGCCCA TGTTTTTGGG GATTTAATCT GTGCTCTGTG 9780 
GCCTCACCTT GCCTTCCCTG AGCCAATTTC CCTTTCTAAA GGTGGTCACT GCCTGGTAAG 9840 
TTTGGAGTAA GGGACGGTCA GAATCATTTC CCCTACAGTC AGGTTGTTTG ATGGGGGATG 9900 
AAAAGAGACA GCAGGAAGTT TTGTGTTTCT GCAAAGACAG AAGCAGTTCA GGCGACAGTA 9960 

AGAGGCTGGG GTGTCCAGGA GGGTGTGTCT GGCAGTAGGG TCGCTGGTTT CTCATCCTTG 10020 

AACCTAATTG CACTGTCAGT CGGCCCCTCA GGCCTGAGCA GATGGGAAGG TTTGTCCCCT 10080 

GCCCTGCAGC AAGAGGGCCC TGTCCAGGAG GCACCCACAA CAGAGGCAGT GCAGGTCTGT 10140 

GGTCACTCCT ACTCTCACCT GTGGCGTCTC CCGTAGAGGG ATTGTCAGTT CTGGTTCCCT 10200 

GTGGGCAGGA ATGGTTTCCT CATAGGTCAC TGGAGTTTTG GCCAGGAAAA 6AGTATGAAG 10260 

TTCATGTGGC AGTTTCTCAA AATTCCTGCT TTCAATGTTG ATGTCCAGTA AAGATATTCG 10320 

TAATTTCAGC TCTATAATCT TAATAGGATT TCCTCTAATA TT6TGAAGCA TATTATATGA 10380 

AACAGGAACA CAAATTTCTC AAAATTCCTG CGATGTCCAA TAAAGATTTT CATAATTTCA 10440 

GCTCTGCAAT CTTAATAGGA TTTCCTAATA CTGTAAAGCA TATTAAATGA AACAGGAACT 10500 

CAAATTTGGA GCCCCCTCTC CAGGAGGTTC TGTGTGGAGA TGGTGGCTGT GGCAGTGGCA 10560 

GTTCCCAGGT GCAGAGGGTG GGCAGAGGCA GCCTCAGGCT AAGGGGTCTC CCCTACTCCA 10620 

CATGGAGAAA ATCCCTTGTA GGTTGCAAGG GCAGTGGCCG GGTGGAATCC CTGCTAGGGA 10680 
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CAGAGCAGGA AGGCCTCX3CA GCCTCACCAA GCAGCAGCCC TGGGGTGGAG CTGCGTTTCC 
AGGGTTAAGC GGACCAGGCA GGAGTAGCGG TTACTCAAGA GCAGGTCACA GGCTTGGGTT 
GTGAGGGTCA GGAGAGGCCA GGCCTCCTCG AGCAAGGTGG GGGTCCCAGG GTCAGGTCAG 
GTGCAGATCC TGTGGCAGCC ACX5TCTTTCC ATGCTGGGCC TGCTGGGCCC CCCAGGCTTC 
CTGATGGGGT CCCCAGTTAG GAGCTGCCTG CTCAGGGCTG GGAGGGGAGG AGCACTGAGC 
TGCAGATAGA GGGCAGAGCC CACAGTGGGC AGGGCCTGCC CTGGTGTGTA GGTGCCTCTG 
CAGGAGAGGA GGGCCTGGGG. ACTGAGAGCA^ AGGGTCAGGG-CCTCTCTTTG-GGGA 
TCACTGTAAC AG6ACTGGTC AGGCCTGAGA GGAGGGCACT GGGTTCCCTC TTGGGTCTTG 
TCCTTTAGTC TTGGGGCCCT TTCCCTCCCT GCACGATGAG TGGTGGGCAC AGGGCACGGG 
CTGATGTTGA TGGAGTGATG GGAGGGAACT GGCAGGGGCT GGGAAAAGCA AGGAGGGAGG 
AAGAAAAAAG TGGGGGCCTC ATCTTCCCTC AGAGAAAGGG CAAATCTGGT TTTGGAGCAA 
CTGAAGAGAG AAAAGTCCCC AGGGAATAAA CACAACACTG CACCCAGTGG AGCATTTACC 
CATTTCCCTC TTTTCTCCAG AGCTCGTGAG CCTGCAGGTC CTGGATCAAC ACCCAGTTGG 
GAC6AGTGAC CACAGGGATG CCACACAGCT CGGATTTCAG CCTCTGATGT CAGCTCTTGG 
GTCCACTGGG TCCACTGAGG GCGCCTAGAC TCTACAGCCA GGCGGCTGGA ATTGAATTCC 
CTGCCTGGAT CTCACAAGCA CTTTCCCTCT TGGTGCCTCA GTTTCCTGAC CTATGAAACA 
GAGAAAATAA AAGCACTTAT TTATTGTTGT TGGAGGCTGC AAAATGTTAG TAGATATGAG 
GCATTTGCAG CTGTGCCATA TT 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Leu Gly Pro Val Phe Leu Leu Leu Ala Gly lie Phe Pro Phe 
15 10 15 

Ala Pro Pro Gly Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
20 25 30 

Thr Val Leu Ser Trp Asp Gly Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 
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Val His Leu Asp Gly Gin Pro Phe Leu Arg Tyr Asp Arg Gin Lys Cys 

50 .55 60 

Arg Ala Lys Pro Gin Gly Gin Trp Ala Glu Asp Val Leu Gly Asn Lys 
65 70 75 80 

Thr Trp Asp Arg Glu Thr Arg Asp Leu Thr Gly Asn Gly Lys Asp Leu 
85 90 95 

Arg Met Thr Leu Ala His lie Lys Asp Gin Lys Glu Gly Leu His Ser 
100 105 110 

Leu Gin Glu lie Arg Val Cys Glu lie His Glu Asp Asn Ser Thr Arg 
115 . 120 , 125 

Ser Ser Gin His Phe Tyr Tyr Asp Gly Glu Leu Phe Leu Ser Gin Asn 
130 135 140 

Val Glu Thr Glu Glu Trp Thr Val Pro Gin Ser Ser Arg Ala Gin Thr 
145 150 155 160 

Leu Ala Met Asn Val Arg Asn Phe Leu Lys Glu Asp Ala Met Lys Thr 
165 170 175 

Lys Thr His Tyr His Ala Met His Ala Asp Cys Leu Gin Glu Leu Arg 
180 185 190 

Arg Tyr Leu Glu Ser Ser Val Val Leu Arg Arg Arg Val Pro Pro Met 
195 200 205 

Val Asn Val Thr Arg Ser Glu Ala Ser Glu Gly Asn He Thr Val Thr 
210 215 220 

Cys Arg Ala Ser Ser Phe Tyr Pro Arg Asn He Thr Leu Thr Trp Arg 
225 230 235 240 

Gin Asp Gly Val Ser Leu Ser His Asp Thr Gin Gin Trp Gly Asp Val 
245 250 255 

Leu Pro Asp Gly Asn Gly Thr Tyr Gin Thr Trp Val Ala Thr Arg He 
260 265 270 

Cys Gin Gly Glu Glu Gin Arg Phe Thr Cys Tyr Met Glu His Ser Gly 
275 280 285 

Asn His Ser Thr His Pro Val Pro Ser Gly Lys Val Leu Val Leu Gin 
290 295 300 

Ser His Trp Gin Thr Phe His Val Ser Ala Val Ala Ala Ala Ala Ala 
305 310 315 320 

Ala He Phe Val He He He Phe Tyr Val Arg Cys Cys Lys Lys Lys 
325 330 335 



Thr Ser Ala Ala Glu Gly Pro Glu Leu Val Ser Leu Gin Val Leu Asp 
340 345 350 
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Gin His Pro Val Gly Thr Ser Asp His Arg Asp Ala Thr Gin Leu Gly 
355 360 365 

Phe Gin Pro Leu Met Ser Ala Leu Gly Ser Thr Gly Ser Thr Glu Gly 
370 375 380 

Ala 
385 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2380 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGGCCATGGG GCTGGGCCGG GTCCTGCTGT TTCTGGCCGT CGCCTTCCCT TTTGCACCCC 
CGGCAGCCGC CGCTGAGCCC CACAGTCTTC GTTACAACCT CATGGTGCTG TCCCAGGATG 
AATCTGTGCA GTCAGGGTTT CTCGCTGAGG GACATCTGGA TGGTCAGCCC TTCCTGCGCT 
ATGACAGGCA G7VAACGCAGG GCAAAGCCCC AGGGACAGTG GGCAGAAGAT GTCCTGGGAG 
CTAAGACCTG GGACACAGAG ACCGAGGACT TGACAGAGAA TGGGCAAGAC CTCAGGAGGA 
CCCTGACTCA TATCAAGGAC CAGAAAGGAG GCTTGCATTC CCTCCAGGAG ATTAGGGTCT 
GTGAGATCCA TGAAGACAGC AGCACCAGGG GCTCCCGGCA TTTCTACTAC GATGGGGAGC 
TCTTCCTCTC CCAAAACCTG GAGACTCAAG AATCGACAGT GCCCCAGTCC TCCAGAGCTC 
AGACCTTGGC TATGAACGTC ACAAATTTCT GG/^GGAAGA TGCCATGAAG ACCAAGACAC 
ACTATCGCGC TATGCAGGCA GACTGCCTGC AGAAACTACA GCGATATCTG AAATCCGGGG 
TGGCCATCAG GAGAACAGTG CCCCCCATGG TGAATGTCAC CTGCAGCGAG GTCTCAGAGG 
GCAACATCAC CGTGACATGC AGGGCTTCCA GCTTCTATCC CCGGAATATC ACACTGACCT 
GGCGTCAGGA TGGGGTATCT TTGAGCCACA ACACCCAGCA 6TGGGGGGAT GTCCTGCCTG 
ATGGGAATGG AACCTACCAG ACCTGGGTGG CCACCAGGAT TCGCCAAGGA GAGGAGCAGA 
GGTTCACCTG CTACATGGAA CACAGCGGGA ATCACGGCAC TCACCCTGTG CCCTCTGGGA 
AGGTGCTGGT GCTTCAGAGT CAACGGACAG ACTTTCCATA TGTTTCTGCT GCTATGCCAT 
GTTTTGTTAT TATTATTATT CTCTGTGTCC CTTGTTGCAA GAA6AAAACA TCAGCGGCAG 
AGGGTCCAGA GCTTGTGAGC CTGCAGGTCC TGGATCAACA CCCAGTTGGG ACAGGAGACC 
ACAGGGATGC AGCACAGCTG GGATTTCAGC CTCTGATGTC AGCTACTGGG TCCACTGGTT 
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CCACTGAGGG CGCCTAGACT CTACAGCCAG GCGGCCAGGA TTCAACTCCC TGCCTGGATC 1200 

TCACCAGCAC TTTCCCTCTG TTTCCTGACC TATGAAACAG AAAATAACAT CACTTATTTA 1260 

TTGTTGTTGG ATGCTGCAAA GTGTTAGTAG GTATGAGGTG TTTGCTGCTC TGCCACGTAG 1320 

AGAGCCAGCA AAGGGATCAT 6ACCAACTCA ACATTCCATT GGAGGCTATA TGATCAAACA 1380 

GCAAATTGTT TATCATGAAT GCTIGGATGTG GGCAAACTCA CX^ACTGCTCC TGCCAACAGA 1440 

AGGTTTGCTG AGGGCATTCA CTCCATGGTG CTCATTGGAG TTATCTACTG GGTCATCTAG 1500 

AGCCTATTGT TTGAGGAATG CAGTCTTACA AGCCTACTCT GGACCCAGCA GCTGACTCCT 1560 

TCTTCCACCC CTCTTCTTGC TATCTCCTAT ACCAATAAAT AC6AAGGGCT GTGGAAGATC 1620 

AGAGCCCTTG TTCACGAGAA GCAAGAAGCC CCCTGACCCC TTGTTCCAAA TATACTCTTT 1680 

TGTCTTTCTC TTTATTCCCA CGTTCGCCCT TTGTTCAGTC CAATACAGGG TTGTGGGGCC 1740 

CTTAACAGTG CCATATTAAT TGGTATCATT ATTTCTGTTG TTTTTGTTTT TGTTTTTGTT 1800 

TTTGTTTTTG AGACAGAGTC TCACTCGTCA CCCAGGCTGC AGTTCACTGG TGTGATCTCA 1860 

GCTCACTGCA ACCTCTGCCT CCCAGGTTCA A6CACTTCTC GTACCTCAGA CTCCCGATAG 1920 

CTGGGATTAC AGACAGGCAC CACCACACCC AGCTAATTTT TGTATTTTTT GTAGAGACGG 1980 

GGTTTCGCCA AGTTGACCAG CCCAGTTTCA AACTCCTGAC CTCAGGTGAT CTGCCTGCCT 2040 

TGGCATCCCA AAGTGCTGGG ATTACAAGAA TGAGCCACCX3 TGCCTGGCCT ATTTTATTAT 2100 

ATTGTAATAT ATTTTATTAT ATTAGCCACC ATGCCTGTCC TATTTTCTTA TGTTTTAATA 2160 

TATTTTAATA TATTACATGT GCAGTAATTA GATTATCATG GGTGAACTTT ATGAGTGAGT 2220 

ATCTTGGTGA TGACTCCTCC TGACCAGCCC AGGACCAGCT TTCTTGTCTIC CTTGAGGTCC 2280 

CCTCGCCCCX3 TCACACCGTT ATCGATTACT CTGTGTCTAC TATTATGTGT GCATAATTTA 234 0 

TACCGTAAAT GTTTACTCTT TAAATAAAAA AAAAAAAAAA 2380 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Leu Gly Arg Val Leu Leu Phe Leu Ala Val Ala Phe Pro Phe 
15 10 15 

Ala Pro Pro Ala Ala Ala Ala Glu Pro His Ser Leu Arg Tyr Asn Leu 
20 25 30 
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Met Val Leu Ser Gin Asp Glu Ser Val Gin Ser Gly Phe Leu Ala Glu 
35 40 45 

Gly His Leu Asp Gly Gin Pro Phe Leu Arg Tyr Asp Arg Gin Lys Arg 
50 55 60 

Arg Ala Lys Pro Gin Gly Gin Trp Ala Glu Asp Val Leu Gly Ala Lys 
65 70 75 80 

Thr Trp Asp Thr Glu Thr Glu Asp Leu Thr Glu Asn Gly Gin Asp Leu 
85 90 95 

Arg Arg Thr Leu Thr His lie Lys Asp Gin Lys Gly Gly Leu His Ser 
100 105 110 

Leu Gin Glu lie Arg Val Cys Glu lie His Glu Asp Ser Ser Thr Arg 
115 120 125 

Gly Ser Arg His Phe Tyr Tyr Asp Gly Glu Leu Phe Leu Ser Gin Asn 
130 135 140 

Leu Glu Thr Gin Glu Ser Thr Val Pro Gin Ser Ser Arg Ala Gin Thr 
145 150 155 160 

Leu Ala Met Asn Val Thr Asn Phe Trp Lys Glu Asp Ala Met Lys Thr 
165 170 175 

Lys Thr His Tyr Arg Ala Met Gin Ala Asp Cys Leu Gin Lys Leu Gin 
180 185 190 

Arg Tyr Leu Lys Ser Gly Val Ala lie Arg Arg Thr Val Pro Pro Met 
195 200 205 

Val Asn Val Thr Cys Ser Glu Val Ser Glu Gly Asn lie Thr Val Thr 
210 215 220 

Cys Arg Ala Ser Ser Phe Tyr Pro Arg Asn He Thr Leu Thr Trp Arg 
225 230 235 240 



Gin Asp Gly Val Ser Leu Ser His Asn Thr Gin Gin Trp Gly Asp Val 
245 250 255 

Leu Pro Asp Gly Asn Gly Thr Tyr Gin Thr Trp Val Ala Thr Arg lie 
260 265 270 

Arg Gin Gly Glu Glu Gin Arg Phe Thr Cys Tyr Met Glu His Ser Gly 
275 280 285 

Asn His Gly Thr His Pro Val Pro Ser Gly Lys Val Leu Val Leu Gin 
290 295 300 



Ser Gin Arg Thr Asp Phe Pro Tyr Val Ser Ala Ala Met Pro Cys Phe 
305 310 315 320 

Val He He He He I^eu Cys Val Pro Cys Cys Lys Lys Lys Thr Ser 
325 330 335 
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Ala Ala Glu Gly Pro Glu Leu Val Ser Leu Gin Val Leu Asp Gin His 
340 345 350 

Pro Val Gly Thr Gly Asp His Arg Asp Ala Ala Gin Leu Gly Phe Gin 
355 360 365 

Pro Leu Met Ser Ala Thr Gly Ser Thr Gly Ser Thr Glu Gly Ala 
370 375 380 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACTGGGGAAC AAGGTTTATA TGAGA 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGTCACCCGT CTTCTACAGG ACCC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 7: 
GGGGCCATGG GGCTGGG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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ATCTGAGATG TCGGTCC 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CGTTCTTGTC CCTTTGCCCG TGTGC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACCCTTCCC TTACCCCCGT CGTAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TATGTAAAAC GACGGCCAGT TTCACCTGTG ATTTCCTCTT CCCCA 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGTCTTTTCA ATCCCCGTCT CTCGTCCAGT ATCGACAAAG GACAT 



(2) INFORMATION FOR SEQ ID NO: 13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TATGTAAAAC GACGGCCAGT TTCGGGAATG GAGAAGTCAC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGAGAGGAGA GGGAGGTTAA CCAGTATCGA CAAAGGACAT 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TATGTAAAAC GACGGCCAGT GTTCCTCTCC CCTCCTTAGA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAAAAGTCCC TTTCACGACC ACCAGTATCG ACAAAGGACA T 



